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Induction of anti-tumor cytotoxic T lymphocytes in humans using 
peptide epitopes found by computer based algorithms for vaccination 

Description 

This invention relates to a method for providing, identifying or/and 
optimizing peptides which induce cytotoxic T-lymphocytes and to the uses 
of the thus obtained peptides, in particular, for vaccination. 

In particular, this invention relates to a method for predicting and 
optimizing peptides and peptidomimetics, based on the application of 
pattern recognition technologies such as, for example, artificial neural 
networks, in combination with a selection for the highest degree of 
conservation, in particular, phylogenetic conservation and optimization 
through amino acid exchange at the anchor positions of the MHC-binding 
peptides, and the use of the identified amino acid sequences in a peptide 
pool, e.g. together with additional helper antigens as co-stimulators for 
vaccination. 

The present invention further relates to compositions and methods for the 
treatment of cancer and the treatment or prevention of viral infections. The 
invention, in particular, provides peptides based on a 9 residue epitope 
derived from tumor-associated or viral antigens. The peptides induce 
cytotoxic T cells that destroy tumor cells and virus-infected cell. 

Further, this invention relates to computer-assisted analysis of biological 
molecules, particularly of biologically active peptides and peptide mimetics, 
and the prediction of their biological and pharmacological potencies. 
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Vaccines against tumors or viruses are based on specific antigens, in 
particular, on weakly immunogen-specific antigens, admixed to adjuvants 
in order to elicit, restore or augment immune responses against tumor cells, 
e.g. residual or metastatic tumor cells, or virus-infected cells. Cellular 
cytotoxicity is considered to play a major role in the elimination of tumor 
cells or virus-infected cells. Activation of cellular cytoxicity within an 
organism requires at least three synergistic signals: Epitopes derived from 
tumor-specific antigens presented by MHC class I molecules (HLA 
restriction), co-stimulatory signals provided by cell surface molecules of 
antigen-presenting cells (APCs), e.g. B-7.1 and B-7.2, and differentiation 
and propagation signals of cytokines. 

To activate cellular cytotoxicity it is therefore of great interest to find 
and/or provide pertinent HLA-restricted epitopes, especially also in view of 
the widespread occurrence of cancer and viral diseases. Therefore, it was 
an object of the invention to provide peptides which induce cytotoxic T- 
lymphocytes. 

According to the invention this object is achieved by a method for 
providing, identifying or/and optimizing peptides which induce cytotoxic T- 
lymphocytes, comprising the steps: 

(a) selecting one or more antigenic proteins, 

(b) selecting conserved regions within the protein sequence of 
the one or more antigenic proteins, and 

(c) identifying CD8+ T-cell epitopes within the protein sequence 
of the one or more antigenic proteins, preferably within the 
phylogenetically conserved regions. 

According to the method of the invention one or more antigenic proteins 
are selected in a first step. In particular, relevant antigenic proteins for 
various cancers or viruses are taken. The selection can be performed, for 
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example, by the man skilled in the art referring to literature or references 
describing antigenic proteins associated with cancers and viruses. 

In a second step, conserved regions within the protein sequence of one or 
more antigenic proteins are determined. The determination of conserved 
regions can be effected, for example, by comparison with other proteins, 
e.g. proteins stored in a database. In step (b) according to the invention 
conserved regions, i.e. regions which are subject only to minor changes 
during evolution, are determined. The selection of conserved regions, in 
particular, has the advantage that a high response rate is achieved in 
subsequent use of the peptides for inducing cytotoxic T-lymphocytes, and 
high effectiveness against the cancer cells and viruses to be attacked. In 
contrast to highly variable regions, conserved regions change only slightly 
and, thus, represent an excellent target for combatting cancer cells or 
viruses. It is especially preferable to select phylogenetically conserved 
regions within the protein sequences of the one or more antigenic protein. 

In a further step according to the invention CD8+ T-cell epitopes are 
identified within the protein sequence of the one or more antigenic proteins 
and preferably within the conserved regions, in particular, within the 
phylogenetically conserved regions. Determination of CD8 + T-cell epitopes 
can be effected by means of pattern recognition technologies and, 
especially by using an artificial neural network (ANN). Artificial intelligence 
and pattern recognition methods have been proven to be powerful tools in 
the bioinformatics field. For example, an artificial neural network (ANN) has 
been successfully applied to predict mitochondrial precursor cleavage sites 
(G.Schneider, P.Wrede, J.Mol.EvoI.36, 586 (1993) and membrane- 
spanning amin acid sequences (R.Lohmann, G.Schneider, D.Behrens and 
P.Wrede, Protein Science 3, 1597 (1994); M.Milik and J.Skolnick, in: 
"Proceedings of Fourth Annual Conference on Evolutionary Programming", 
MIT Press, La Jolla (1995)). 
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However, the identification of CD8+ T-cell epitopes or the prediction of 
MHC-I binding can be done by any technology available to the man skilled 
in the art. In particular, pattern recognition technologies can be applied. 
Preferably, however, an artificial neural network is used, since an ANN 
allows for prediction of MHC-I binding peptides with high accuracy. 
Particularly preferred an ANN is used which has been trained with an 
evolutionary algorithm. 

In a preferred and advantageous embodiment, the method according to the 
invention further comprises the step: 

(d) optimizing the identified CD8+ T-cell epitopes by exchanging one or 
more amino acids. 

Preferably, the amino acids are exchanged in the anchor positions of the 
epitopes, in particular, in the anchor residues of the MHC-1 binding 
peptides. Particularly preferred, said optimizing step is performed prior to 
the step of identifying CD8+ T-cell epitopes. According to the invention 
modified epitopes, too, are thus tested for their binding efficacy, as a result 
of which new effective peptides can be found. 

Optimization of the CD8+ T-cell epitopes is preferably effected by 
exchanging the amino acid present by another amino acid at one or more 
positions of the peptides. Said exchange can be effected randomly and at 
arbitrary positions. It is preferred, however, to first determine anchor 
positions and then exchange the amino acids present at said anchor 
positions. Preferably amino acids are taken in exchange which are known 
to increase binding to MHC-I at these anchor positions. 

By means of the method of the invention, in particular, peptides having a 
length of from 4-30, more preferably from 5-20, still more preferably of at 
least 6, at least 7, at least 8 or at least 9 amino acids, and up to 15, 14, 
13, 12, 1 1 or 10 amino acids are obtained. It is particularly preferred to 
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apply the invention to peptides having a length of 8, 9 or 1 0 amino acids, 
especially 9 amino acids. 

The term peptide as used herein also includes peptide mimetics which 
contain one or more non-naturally occurring amino acid, e.g. homoarginine, 
ornithine, etc. 

Selection of suitable peptides which induce cytotoxic T-iymphocytes can 
be effected by means of the above-described procedural steps, in 
particular, by selecting the respective best candidates of each procedural 
step, e.g. the best 50%, the best 30% or the best 10%. In addition, it is 
possible to incorporate filtering steps, by means of which particular 
peptides are selected and picked out as preferred or disposed of. 

According to the invention the predicted identified or optimized epitope 
peptides can be verified by in vitro or in vivo tests, especially by in vitro 
tests. 

The peptides obtained according to the invention, finally, can be used as 
pharmaceuticals, especially as a vaccine. In particular, tumors and virus 
infections can be treated or prevented successfully by means of the 
peptides obtained according to the invention. 

Therefore, the invention further relates to a pharmaceutical composition 
comprising one or more peptides obtainable by the method described 
above. This pharmaceutical composition can comprise further adjuvants, 
co-factors and/or co-stimulating agents, e.g. recall antigens as adjuvants 
for CD4* T-cell stimulation and for induction of co-stimulation for peptide 
and disease-specific CD8* cytotoxic T-cells. Particularly preferred, the 
pharmaceutical^ composition is a vaccine, in particular, a vaccine for the 
treatment and/or prevention of cancer or viral infections. The 
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pharmaceutical composition can be in any suitable administration form, 
with intracutaneous and parenteral administration being preferred. 

An important and most preferred aspect of the invention is the combination 
of methods to identify peptides and the subsequent use of the peptides 
found as pharmaceutical composition, in particular, for vaccination. 
Therefore, a most preferred embodiment of the invention is a method for 
providing a pharmaceutical composition for the induction of cytotoxic T- 
lymphocytes cmoprising: 

(a) providing one or more peptides which induce cytotoxic T- 
lymphocytes according to the method described above, and 

(b) using the one or more peptides for the manufacture of a 
pharmaceutical composition. 

The invention allows, in a unique manner, to combine these two steps. In 
particular, the invention allows to actually provide pharmaceuticals, starting 
out from computer-based predictions. 

The invention further relates to the peptides discovered by means of the 
inventive method, in particular, as shown in Tables 1, 2, 3 and 4 below, as 
well as to pharmaceutical compositions containing one or nnore^o £ these 
peptides or other^pe ptides discovered hy meansi-of the method of the 
invention, in particular, at least 2, at least 3, at least 4, at least 5, at least 
10 or at least 20 and up to 100, preferably up to 90, up to 80, up to 70, 
up to 60 or up to 50 of such peptides. 

Further peptide sequences of the invention are as shown in the following. 
In these sequences the amino acid at positions 2, 6 or/and 9 each 
independently can be replaced by V, L, I or/and M. 
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catdjzuxnan 

1.000000 
0.999313 
1.000000 
0.997769 
0.989067 
0.999877 
0.999934 

creb^human 

0.916947 
0.998294 
0.989 f 879 
0.997264 
0.999142 
0.999527 
0.999975 
0.999923 

cta.gr iixunan 

0.999339 
0.999982 
0.999974 
0.999991 
0.998567 
0.999960 



YIiSQDTVSV 
KLVDQNXFS 
LVDQNIFSF 
DQNIFSFYTi 
VTRKMTWQV 
QVHLDQVEV 
HTtT)QVSVAS 



HiNDLSSDA 

THiQYAQTT 
DVQTYQXRT 
AARKREVRXi 
AVXxSNQNKT 
VLENQNKTL 
TXjIKELKAIj 



EIAKRSLAQ 
VLIiKEFTVS 

XLTXHIiTAA 
TZKIiTAADH 
AADHRQLQIi 



150 
222 
223 
225 
264 
271 
273 



137 
219 
220 
248 
282 
316 
317 
324 



•158 
-23 0 
-231 
-233 
-272 
-279 
-281 



-145 
-227 
-228 
-256 
-290 
-324 
-325 
-332 



103-111 
121-129 

131- 139 

132- 140 
134-142 
13 9-147 



earb2_h.uman 

0.999709 
0 .999802 
0.999996 
0.999996 
0.974558 
0.998018 
0.611272 
0.999929 
0-996131 
0.947400 
0.999993 



SFLQDXQEV 
LXAHNQVRQ 
QLFEDNYAL 
LFEDNYAIiA 
QLRSLTEZL 
TXLWKDXFH 
IIiWKDXFKK 
DXFHKNNQL 
KNNQLAIiTL 
NNQIAIiTLI 
QLALTL.IDT 



72-80 
85-93 

106- 114 

107- 115 
141-149 

166- 174 

167- 175 
171-179 

175- 183 

176- 184 
178-186 
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0.999957 


IiXIJlJNiwrLfi 


183 -191 


0.999927 


ALrVT X-Nl u 1 


270-27 8 


0.999967 




•3*71 -?7<? 


0.993428 


HLREVRAVT 


I/O w *3 c-7 


0.834140 


AVTSANIQE 




0.99973 6 


VTSANTQEF 


•5C^ "3 C A 
jjo -J &4 


0.999164 


QVFKTLEEI 


398-400 


0.922432 


VFETLEEIT 


399-407 


0.99.9993 


SVFQNLQVZ 


423-431 


0.999985 


AIiIHHNTHL 


466-474 


0.704970 


LTSIISAW 


£S\-^59 


0.999930 


IiXKRKQQKI 


674-682 


0.990487 


RIiLQETEI»V 


689-697 


0.999361 


ETEIiRXVICV 


717-725 


0.998516 


AXKVIiSSOT 


751-759 


0.891765 


LTSTVQLVT 


790-798 


0.999969 


STVQLVTQIi 


792-800 


0.999802 


YI*EDVRLVE 


835-843 


0.999964 


KLVHKDliAA 


840-848 


0.999992 


DIiAAENVIiV 


845-853 


0.999993 


T.T.m DETEY 


869-877 


0.999885 


DIDETEYH& 


871-879 


0.999956 


SILRHEFTH 


893-901 


U • 27 i? / 6J O 




894-902 


0 .999221 


RFTHQSDW 


898-906 


U . D o *± J *C O 




900-908 


0 .999962 


RFESLVSEF 


968-976 


0.999439 


FWXQNEDL 


985-994 


0.990193 


DLVDAEHYXj 


10I6 r i024 


0 .999995 


LVDAEEYLV 


101-7-1025 


gp 1 0 0_immaii 






0.986531 


QVXWVKNTI 


101-109 


1 .000000 


VTWNNTXI 


102-110 


0 .931766 


SWSQKRSTV 


142-150 


0.999969 


SFVTVWKTW 


148-156 


0 .999897 


SVSVSQLRA 


216-224 


0.999997 


YLAEADLSY 


250-258 


0.990549 


VTAQWIrQA 


286-294 


0.999610 


TTAAQVTTT 


413-421 


0 .995788 


7VAOVTTTEW 

JTV*<itC " * — * ™ ~ 


415-423 


0.983375 


VTTTSWVET 


418-426 


0 . 999911 


SFSVTLDZV 


482-490 


0.999597 


NVSLADTNS 


568-576 


0.999BB2 


SIADTNSIA 


570-578 


0 .994579 


IifiDTNSXtAV 


571-579 


0-998051 


HSSSHWIxRL 


632-640 


mag el__nuraan 






:::::::::::::: 
0 .999970 


ALEAQQEAL 


15-23 


0.999915 


ZLESLFBAV 


93-101 


1. 000000 


VXT2DCVADL 


101-109 


0.927034 


ASESLQLVF 


147-155 


0 .978865 


KLLTQDLVQ 


237-245 


0-999998 


liVQEKYLEY 


243-251 


0-99G432 


IAETSYVKV 


271-279 


0.999888 


YVKVIiEYVZ 


276-284 


0.999949 


KVLEYVTKV 


278-286 


0 .985293 


YVXKVSARV 


-282-290 


0.999463 


KVSARVRF? 


285-293 



WO 02/072627 



- 9 



PCT/EP02/02666 



mage2_h.uman 

::::::::•'•* : : 

0.9513 25 

0.951515 

0.338975 

Q.*999958 

0.99994ff 

0.999920 

1.000000 

0.952279 

0-999993 

0.999998 

0.999988 

0.989867 

0.999985 

mag e3 ^jb-nmnn 

ZZZZZmZZZZZ . 

0.352244 
0.999920 
X. 000000 
0.999951 
0.994726 
1.000000 
0.952279 
0.999930 
1.000000 
0.989063 
1.000000 
0.999532 
0.999378 
0.999995 
0 .999978 

mage4_iLumazi 

0.998535 
0.999985 
0.997845 
0.993083 
0.982555 
0.991132 
0,999989 
0.995432 
0.999951 
0.998258 

mage5_hi2m£ii 

0.999973 
0.999964 
1.000000 
0.999983 
0.997569 
1-000000 

mage 6 immajo. 

0.962244 
0.999920 
1.000000 
0.999978 
1-000000 



ATEEQQTAS 
QTASSSSTL 
SFSTTINYT 
STTHT2TLW 
TXNYTLWRQ 
DLESEFQAA 
LVHFIlLLKY 
HFLLLKTOA 
VIFSKASEY 
LVQENTLEY 
LIETSYVKV 
YVKVIiHHTL 
KVU3HTL2CE 



AASSSSTLV 
DIiESEFQAA 

KVAELVHKL 
VAEIiVEFIiIj 
LVHFTiTiTiCT 
HFTiTiTiKTOR. 
VXFSKASSS 
IFSKASSSL 
ASSSLQLVF 

KLLTQHFVQ 
LLTQHFVQE 
FVQENYIiEY 
IiVETSYVKV 



TTEEQEAAV 
AliSKECVTJEL 
KVDEIiAHFL 

KLIiTQDWQ 
IiLTQDWQE 
WVQENYIiEY 
IiAETSTTSV 
KVTiRHWRV 
KWRVNARV 



AZDF2XWRQ 
DFTLWRQSI 

KVMIiIHFL 
VBDLIHFLL 
LZHFIiLLKY 



32-40 

37-45 

70-78 ' 

72-80 ' 

74-82 

100-108 

116-124- 

118-125 

149-157 

250-258 

278-285 

283-291 

285-293 



38-45 

100-108 

108-116 

112- 120 

113- 121 
116-124 
118-126 

149- 157 

150- 158 
154-152 
220-228 

244- 252 

245- 253 
250-258 
278-286 



32-40 
109-117 
113-121 
113-127 

245- 253 

246- 254 
251-259 
279-287 
286-294 
290-298 



74-82 
76-84 
108-116 

112- 120 

113- 121 
116-124 



AASSSSTLV 
DLESEFQAA 
ALSRXVAXL 
KVAKLVHFIt 
LtVHFTiTiTtKY 



38-46 

100-108 

108-116 

112-120 

115-124 
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0 


a CT>7 Q 

. 3 ( 2 


HrTtTiTiKxRA 


118 


-125 


0 


o a o tr "5 n 


VXFSKASDS 


149 


-157 


0 




IFSKA5DSL 


ISO 


-15 8' 


0 




T\ Till 

ASDSLQLVr 


154 


-162 


1 


.000000 


KIwEklSVL 


220.- 


-228 


0 


.999880 


KLLTQYFVQ 


244-252 


0 


.976478 


LLTQYFVQE 


245- 


■253 


0 


,999996 




250- 


■258 


0. 


.999988 




278- 


-286 



mage 8_fauman 



0.998794 
0.999999 
1.000000 
. 0 .997295 
0.999966 
0.998206 
0.9999X5 



AASSSSTLI 3 8-46 

SL2VTDSTL 71-79 

ALDEXVAEIi 111-119 

VAZLVRPIili 116-124 

RFIiEFJOTQI 121-129 

SVXKNYKNH 141-149 
142-150 



0.999999 

0 . 999512 

1.000000 

0.999951 

0.994726 

0.9984,22 

0.995920 

0.999636 

0.999991 

0.982666 

0.991132 

0.999989 

0.902355 

• •••I*!!!!! 

magaA_imman 

••••••iri** 

0.997990 

0.959999 

0-999324 

0.999G23 
1. 000000 
0.999762 
0.997688 
0.997563 
0.982666 
0.991132 
0.999589 
0.999984 



• : • J :::::: : 

0.99 654,2 

0.999977 

1 . 000000 

0-999993 

1.000000 

0.999952 

0.999894 

0.999975 

0.989319 

0 .999988 



SISVYYTLW 
SVYYTLWSQ 
AIiICLKTAEIL 
KVAEIiVHFL 
VAELVHFLL 
LVHFIiLHKY 

EVXHEAIiSV 
XLLTQDWVQ 
LLTQDWQE 
KVQZNYLEY 



AVEED AS SS 
EUJEKVTDL 
KVTDLVQFL 
VTDLVQFLL 

XLHSVXKNY 
SVXKNYEDH 

KLLTQDWVQ 
IjLTQDWVQE 
WVQSNYIiET 
SLLKFLAKV 



AFFSSTLNV 

HiHDiCiUDL 

KXXDLVHLL 

IXDLVKLLL 

HLLLRTTHV 

SVXKNTEDY 

YVLVTSIiNL 

VLVTSLNLs"" 

LVTSIiNLSY 



68-76 
70-78 
107-115 

111- 119 

112- 12 0 
115-123 
117-125 
13 7-145 
218-226 

243- 251 

244- 252 
249-257 
280-288 



33-41 
133-141 

137- 145 

138- 146 
141-149 
160-168 

163- 171 

164- 172 

269- 277 

270- 278 
275-283 
310-318 



32-40 
39-47 
111-119 

115- 123 

116- 124 
121-129 
141-149 

179- 187 

180- 188 

181- 189 
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0.936562 
0.553551 
0.999599 
0.559671 



mageC__huxnaii 



RLLTQNWVQ 247-255 

LLTQNWVQE 24 8 -25 6 

WVQEKYLVY 253-261 

KVLETCANA 288-296' 



0.541417 


ETASSSSTL 


37-45 




0.999983 


TTNYTLWSQ 


74-82 




0.995919 


DLETSFQVA 


100-1*08 




1.000000 


LVHFTiTiTiKY 


116-124 




0 . 552275 


H Vf .T.T.'K'y^R 


118-126 




0 .999995 


SVTRHFQDF 






0.5557B8 


VIHNFQDFF 


13 9-147 




0 .999993 


VZFSKASEY 


145 -157 




1 . 000000 


M H M 1 iWl V JnJ 






0 .978865 


KLIiTODIiVD 


244-252 ■ 




0 .559993 


IiVOSNYT/RY 

J-J V ^ i ■ I » X JUCj X 


^ J w — O 




0 .559578 


JUi V Jm. LJ X v xw v 


27S-2S £ 
/ o — jl o a 




0 984092 




XQJ X _7 _L 




0 .5555 83 


K W ( .TTSJT .7*. k } 






mc3in2 ixunicLn 








0.955224 




33-41 


30 


0.911561 


SVKEHRKXY 


92-100 


6 


0.99BS0C 


VWNQQESS 


108-116 


7 


0.982795 


STSSRRRAX 


157-165 


15 


0 .999685 


AISETEENS 


164-172 


12 


0.512657 


EHKSDSlSIi; 


183-191 


11 


0.999287 


SXSI^FDSS " 


188-196 


12 


0.5555X3 


SIiSFDESLA 


130-198 


15 


0.558285 


SVSDQFSVE 


240-248 


7 


0.9559.60 


SVEFEVESL 


246-254 


20 


0.999996 


SIiDSEDYSL 


253-261 


25 


0.559888 


XXYSSQEDV 


403-411 


21 



*•••**!!•• 

0.555546 
0.999957 
0.542786 

p53_in2maji 

:::::::::: 

0.595372 

0.595971 

0.516063 

0.399334 

tyx2_humarL 



FtiSEIiTQQL 
EIiTQQIiAQA 
I*IAEKI*RXS 



ETFSDLWKL 
TFSDLWKUj 
NTFRHSVW 
ATiKLKDAQA 



18-26 
21-29 
82-90 



17- 25 

18- 26 
210-218 
347-355 



1. 000000 
0.953786 
.955593 
.955999 
.555576 
.994160 
.955575 
.977456 
0-955984 
1.000000 
1. 000000 



0, 
0. 
0. 
0 , 
0. 
0. 



VXRQNXHSL 
AIiDIAKKEV 
SVYDFFVWL 

rVTWHRYKXi 
VTWHRYHLIi 
TLXSRNSRF 
SXNSRFSSW 
SLDDYNHLV 
FFQNSTFSF 
SLHNLVHSF 



125 
144 
180 
185 
216 
217 
271- 
274- 
.288- 
335- 
367- 



-133 
-152 
-188 
-193 
■224 
225 
275 
282 
256 
347 
375 
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0.999237 
0.999957 
0.995790 
0.955026 
0.990634 
0.955259 



tyro_huitian 

0.999956 
0.999999 
1.000000 
0.999147 
0.999998 
0.999987 
0.987547 
0.999999 
0.995893 
0.999994 
0.999999 
0.999924 
0.997281 
0.999962 
0.998227 
0.927015 
0.979646 



IFWLHSFT 391-399 

WLHSFTDA 3 93-401 

VLHSFTDAI 394-402 

VTNEEIiFLT 439-447 

ELFLTSDQL 443-451 

HLSSKRYTE 509-517 



LLWSFQTSA 9-17 

RIiLVRRNIF 116-124 

LVRKNXFDIj 118-126 

KFFAYXiTIiA 133-141 

YLTLAKHTI 137-145 

TIAKHTISS 139-147 

AKHTISSDY 141-149 

DINXYDLFV 169-177 

FLLRWEQEI 214-222 

SFFSSWQXV 267-275 

XFLLHHAFV 385-393 

Xj^HHAFVDS 387-395 

AFVDSXFEQ 391-399 

FVDSIFEQW 392-400 

SIFEQWliRR 3 95-403 

YLEQASRXW 467-475 

ASRIWSWLL 471-479 



Further peptide sequences of the invention are as shown in the following. In 
these sequences the amino acids at positions 2, 6 or/and 9 each 
independently can be replaced by V, L, I or/and M. 
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Table 2 

Pos. Sequence 



modification 



Identity- scores Comments 



y BCL2_ HUMAN 

^ 154 RIVAFFEFI 
137 RFATWEEL 
V ^ 188 YliNRHLHTW 

CCEM_HUMAN 
25 RI*LLTASLL 



26 
27 
28 
108 



IiIiliTASLLT 
LLTASXiLTF 
IiTASLLTFW 
IIYSNASLL 



G -> I Pos 9 



P -> S Pos 4 



187 
127 
124 



203 
209 
210 
210 
183 



229 
188 
188 



237 
237 
236 
236 
229 



CD19_HUMAN 
427 EFYSNDSNL 
326 VLHKKKKRI 
302 AVTLAYLIF 
287 VLWHWIxLRT 



M -> I Pos 9 



35 
35 
31 
30 



44 
44 
41 
41 



CGD1_KUMAN 
63 SLRKXVATW 
92 YliDRFLSLI 
152 LVNKIiKWNL 



M 
E 



-> L Pos 2 
-> I Pos 9 



431 
491 
320 



709 
656 
630 



CTAG_HOMAN 
129 VLIiKEFTVS 



24 



24 



6,0 69,233 



iT Exp Med 2000 Feb 21; 191 (4 ): 625-30 : 15 AS epitope for MHC-II 
ERB2_HUMAN (Her-2 ) 

827 RLVKRDLAA 587 802 

6,037,135: 10 AA (RIiVERDIiAA R) ; SeQ-Xd 288 for HLA-A3.2 
6,075,122; identical sequence patented Seq ID 18 



832 DLAARNVLV l/L at p os. 9 often ( , 789 

6,075,122: identical sequence patented Seq ID 9 



885 



RFTHQSDVW 



MQC1_HUMAN 
1049 SFFFliSFHI 
113 9 RYNI/TISDV 
1061 QFNSSLEDI 



-> I Pos 9 



611 



42 
39 
44 



850 



817 



42 
39 
44 



TRSR_HU2tfAN 
271 TFAEKVANA 
413 VXAQRDAWI 
455 SIXFASWSA 
489 YXNIjDKAVI, 



-> I Pos 2+9 



219 
232 
251 
222 



287 
312 
332 
293 



TYR2_HUMAN 

188 SVYDFFVWL 111 147 

193 FVWI2HYYSV 124 148 

6,083,703: 10 Aft. peptide Sea-Id: 17; no activity seen in test 
6,132,980: s.o. 



Cancer R£S. X998;58(2iyAS95 



224 FVTWHRYHL 
282 SRNSRFSSW 
3 51 STFSFRKAL 



128 168 
111 146 
106 144 
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CATD HUMAN ^ n fi1 n 

Ice "tissnlwvi g. p -> I 2 ' 9 "J 

272 VTRKAYWQV 456 6Q2 

404 VFDRDNNRV „ , • ___,a 
Oogenetics 1996;43 (6) :392-7 18-mer as lxgand 

PM17_HUMAN 47 56 

258 YLAEADLSY 45 sg 

294 VTAQWLQA 56 
576 NVSLADTNS 

CREB_HUMAN 124 

141 SYRKILNDL Q4 
325 VLENQNKTL. 



P53.HUMAN 

25 ETFSDLWKIi J£ 281 



218 NTFEHSVW 
257 RIXLTIITL 
355 AliELKDAQA 

MXF_HUMAN 

26 FLSETiTQQL 



P -> I Pos 2 295 303 

195 223 



73 103 



MAG1_HUMAN 

117 LVHFLLLKY G -> H Pos 3 == MAG2 126 1« 

6, 037, 135 : seq-ID 1205 ; HLA-3 and 1 1 binding; no CTL response 

j Xmnxunol 1999 Sep 1; 163 (5) :2928-36 : 1 4 i-Jfcer with T-cell response 

13 6 XIiESVXKNY M -> I Pos 1 

129 KLVTKAEIIj M -> I Pos 8 

p -> L Pos 2 

155 ASESLQIiVF 

245 KLLTQDIiVQ 

251 LVQENYXiEY K -> N Pos 5 

279 LXETSYVKV A -> I Pos 2 





MAGA 


Ill 138 




MAGA 


130 150 




MAG 4 








112 135 






117 130 




MAG2 


119 137 




MAG2 


103 130 


-> 


clearly- 


claimed. 



Further peptide sequences of the invention are as shown in the following. In 
these sequences the amino acids at positions 2, 6 or/and 9 each 
independently can be replaced by V, L, I or/and M. 
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Table 3 



Protein (Swiss- 
Prot-fD) 


Peptide 
sequence 


Position in 
.the protein 


Note 


I 


VGR3_HUMAN 


DLAARNILL 


1037-1045 








TTQSDVWSF 


1092-1100 








VLLWEIFSL 


1102-1110 






VEGFJHUMAN 


TLVDIFQEY 


57-65 






CD34_HUMAN 


ILDFTEQDV 


272-280 






! 

i 


TLIALVTSI 


290-298 


at pos9: G - 
>I 


i 

i 

i 
i 


i 
i 


TIQATSRNl 


364-372 


at pos9: G - 
>l 


! 
i 

i 


ETS1 HUMAN 


QLWQFLLEL 


336-344 




l 

! 

1 ' 


PEC1 HUMAN 


VIVNNKEKT 

V 1 V 1 »l «i \Wl V 1 


111-119 




i 




I1IQKDKAI 


270-278 




\ 

i 

r 




SIWNITEL 


316-324 




' 1 


MDM2_HUMAN 


SVKEHRKIY 


92-100 






MM01_HUMAN 


HLTYRIENY 


113-121 








AFQLWSNVT, 


137-145 








LHRVAAHEL 


212-220 







Further peptide sequences of the invention are as shown in the following,! 
in these sequences the amino acid at positions 2, 6 or/and 9 each 
independently can be replaced by V, L, ! or/and M. 
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Table 4 









conservation 


conservation 


-ANN score 


position 


Sequence 


Filter 


score 


score ' 


rrp2 
442-450 




1 


253 


272 
271 
268 
268 
268 
268 
267 


0.732 
0.873 
0.753 
0.973 
0.948 
0.980 
0.900 


659-667 

WWW JW 1 




1 


262 


510-518 


ii ft,*" | » | * . » 


1 


264 


701 -709 




1 


247 


417-425 
420-428 
638-646 


I/EDSIWXEL 
STWXEIiDEI 
KTLIAXSVF 


1 
1 
1 


230 
232 
237 


rrp3 
546-554 




1 


306 


327 
326 
325 
325 
324 
324 


0.982 
0.B60 
0.502 
0.992 
0.965 
0.992 


736-744 


XRKRNSSHi 


1 


274 


496-504 


VSXDBJXHV 


1 


302 


226-234 


SVYXEVTiHIj 


1 


303 


19-27 
544-552 


XLiTKTTVDH 
SVliVNTYQW 


1 
1 


306 
304 



hema 
51-59 
385-393 
435-443 
463-471 
245-253 
447-455 
382-390 
380-388 



EVTNATETjV 


1 


679 


STQAJlIDQI 


1 


767 


DLWSYNAEIi 


1 


720 




1 


668 




1 


656 


LENQHTXDL 


1 


715 


DIiKSTQAAI 


1 


755 




1 


748 



825 
618 
817 
815 
815 
•B10 
800 
800' 



0.618 
0.708 
0.985 
0.925 
0.969 
0.933 
0.837 
0.741 



vmtl 
153-181 
180-188 
232-240 
102-110 



QX&DSQHRS 1 1 55 

VIASTOZLKA 1 "162 

DTfTfKWIiQAY 1 155 

KLKREITFH 1 149 



179 
177 
171 
171 



0.738 
0.990 
0.953 
0.555 



vmt2 
35-43 
83-91 
39-47 



XJjHLHiWXL 1 9 
AVDADDSHF 1 129 
IIiWTIiDHIiF 1 24 



143 
142 
142 



0.996 
0.989 
0.973 



nram 




1 


217-225 




436-446 


WTSNSIWF 


1 


437-445 


WWTSNSrW 


1 


435-443 




1 


389-397 


JCLQINRQVX 


1 


222-230 


UiRTQESEC 


0 


02-10 


NPNQKZITI 


0 



380 482 0.995 

309 436 0.967 

305 416 0.895 

287 406 0.961 

245 356 0.984 

473 492 0.993 

416 429 0.949 



2^6 SFTVlLTVr 1 0* 93 °" 8 

03-11 NATrNYTIW 1 96 96 0.913 
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Particularly preferred are peptides VTAQVVLQA, VLAQVVLQL, 
LVHFLLLKY, LLHFLLLKL, FVWLHYYSV or FLWLHYYSL, which showed 
particularly high activity in step (b) as well as variants generated by AA 
exchange at position 2, 6 and/or 9, e.g. by V, L, I or M. 

The invention further relates to the use of the peptides found by the 
method of the invention for the production of a pharmaceutical for the 
induction of cytotoxic T-lymphocytes, in particular, for the prevention, 
treatment or diagnosis of cancer or viral infections. 

The invention and the individual procedural steps will be explained in detail 
below. 

HLA-restricted specific epitopes recognized by cytotoxic T cells are 
peptides of defined sequences of amino acids and can be characterized 
with artificial intelligence and pattern recognition methods in combination 
with additional filters and optimization steps described herein. The 
predicted epitope peptides can be verified with biological assays for tumor 
or virus antigen-specific T cell activities using peripheral white blood cells 
of patients as source for the specific T cells. A composition of 
HILA-restrictedjspecific a nti genic peptides (T -1_0 0) for^ Bartiaulai-vir-us or- 
tumor together with adjuvants as CD4+ helper T cell activators can be 
used for effective vaccination. 

A number of HLA-restricted tumor-specific epitopes and antigenic peptides 
for various cancers and viruses detected with the method of this invention 
is attached in the Tables. 

Procedure: 

a) Prediction of MHC-I specific epitopes 

Generation of a prediction tool for MHC-I binding and/or T-cell 
activation. This can be done by using any state of the art 
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technology for structure activity relationship (SAR) model 
generation, like ANN's, support vector machines (SVM's), SIMCA P, 
partial least squares projection to latent structures (PLS) etc.. As the 
basis for the application of these technologies a representative data 
set of peptides is used. This dataset, e.g., consists of peptides, 
known to bind to a given MHC-I molecule, e.g. those stored within 
the SYFPEITH database (Hans-Georg Rammensee, Jutta Bachmann, 
Niels Nikolaus Emmerich, Oskar Alexander Bachor, Stefan 
Stevanovic: SYFPEITHI: database for MHC ligands and peptide 
motifs. Immunogenetics (1999) 50: 213-219) and peptides, that do 
not bind. Due to the fact, that there is only limited data on 
experimentally proven not-binding peptides a set of randomly 
generated peptides can be used for model generation, e.g. all 
epitopes, that can be generated out of the p53 protein. In this 
particular case ANN's were trained for HLA-0201; HLA-0101; 
HLA-1 1 01 , based on the epitopes given in SYFPEITH database using 
an evolutionary algorithm for optimization of weights and biases 
within the neural network. The criteria for using a generated SAR 
model for epitope prediction is the prediction quality of said model 
on a test dataset, that has not been used for training. The neural 
networks used within the next steps of this inventions were able to 
correctly assign almost all test data to the corresponding class 
(binding, not-binding). 

Selection of the relevant antigenic proteins for various cancers and 
viruses. 

This is done according to current state of the art technology and 
knowledge. The following criteria can be used for selection: 

Proteins, described in literature as source of tumor 

associated antigens 

Proteins, involved in apoptotic processes, e.g. p53 
Proteins, belonging to tumor testis antigens and embryonic 
antigens, e.g. MAGE, BAGE, GAGE, CEA, AFP 
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Proteins, that are expressed in specific tissues, e.g. 
tyrosinase 

A procedure defining the degree of conservation for each potential 
epitope within the protein sequence, in particular, a procedure 
selecting (phylogenetically) conserved regions within a protein 
sequence. 

This procedure consists of 3 steps: 

1 . Performing a similarity search against protein and/or nucleic 
acid data bases containing human and/or non-human 
sequences, e.g. by using BLAST (Altschul, Stephen F., 
Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, 
Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSl-BLAST: a new generation of protein 
database search programs", Nucleic Acids Res. 
25:3389-3402) FASTA or any other available tool. 

See example in figure 1 

2. Defining a similarity cutoff, e.g. when using BLASTP the 
"expect threshold" can be set to 1e-30. Only those proteins 
with a similarity higher then the selected cutoff are used to 
perform step 3. 

3. Calculating the degree of conservation for each potential 
epitope. For this, the complete sequence of the selected 
tumor antigen is chopped into overlapping 9-mers (8-mers, 
10-mers). For each of these epitopes a conservation score is 
calculated. This can be done by simply summing up the 
number of identical AA between the selected antigenic 
protein and the identified homologue proteins over all epitope 
positions. Alternatively substitution matrices, e.g. BLOSUM, 
PAM etc. (see. Altschul et.al.) can be used. 

An example is given in figure 2. 
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A procedure generating all possible peptide variants out of each 
epitope within the selected tumor antigen, by exchanging the natural 
amino acid at certain anchor residues by more preferred amino 
acids. In particular, an optimization step where" amino acids (AA) 
within the so-called anchor residues of the MHC-I binding peptides 
are being exchanged. This procedure consists of 3 steps: 

1. Based on the knowledge about known epitopes (Hans-Georg 
Ramrnensee, Jutta Bachmann, Niels Nikolaus Emmerich, 
Oskar Alexander Bachor, Stefan Stevanovic: SYFPEITHI: 
database for MHC ligands and peptide motifs. 
Immunogenetics (1999) 50: 213-219) or by using the so 
called "virtual alanine scan'' technology (see 
PCT/EP01/14808) or by using any other technology the so- 
called "anchor residues" are identified. These are the 
positions within the epitope, that are most important for 
binding to the given MHC receptor. 

2. Moreover, by applying the same technologies, those AA, that 
are most preferable in these anchor positions are identified, 
e.g. for HLA-0201 the anchor position are position 2 and 9 
with L , M, V and I (isoleucine) most preferred in the 
corresponding positions (according to Ramrnensee et ai.). 
These preferred AA can also belong to the group of 
non-natural AA. 

3. The last step comprises the in silico generation of all possible 
peptide variants, e.g. for each epitope there are 8 peptide 
variants in case of 2 anchor residues with 2 different 
preferred amino acids each. These peptides. are only virtually 
generated, so no peptide synthesis has to be applied at this 
stage of the process. When including non-natural AA so 
called peptidomimetics are generated. 

Evaluation of all potential epitopes generated within the previous 
steps by the SAR model, e.g. ANN's trained in step 1 . In particular, 
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prediction of CD8 + T-cell epitopes, e.g. within an ANN. According 
to the results of the prediction the epitopes are ranked. 
The selection (filtering) of epitopes out of the ranked list is 
preferably done according to the following criteria: 

1 . SAR model predict high MHC-I binding for the epitope, 
preferably the highest. 

2. The epitope is predicted to bind to more then one MHC-I 
molecule. 

3. The epitope has high conservation score, preferably the 
highest among all epitopes of a given tumor antigen. 

4. The epitope has the following properties: 

a. The epitope do not contain any of the following amino 
acids: P, M, C, G. 

b. The epitope does not contain four of the aliphatic 
amino acids (I; L;) in line, e.g ILLL is filtered out, but 
ILLFL is permitted. 

c. The epitope do not contain the sequences PEST in a 
line. 

b) Verification of the predicted epitope peptides 

Verification of the predicted epitope with synthetic peptides and 
assays for the cytolytic activity and anti-tumor or anti-virus efficacy 
of the epitope-specific T cells using peripheral white blood cells of 
patients as source of specific T cells. 
Those epitope selected according to part a) of the procedure are 
synthesized with standard procedures and tested in an in vitro assay, e.g. 
as described in PCT/DE99/001 75 and Kern F. et al. Nature Medicine. 
(1998) 4(8):975-8,T-cell epitope mapping by flow cytometry. Those 
epitopes, that cause a specific T cell reaction within this assay are further 
developed into step c). 
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c) Vaccination with predicted epitopes 

Generation of vaccines that consist of 1-100, preferably 2-90, more 
preferably 5-80 and most preferably 10-50 relevant peptides as 
identified by a) and/or b) and optionally specific recall antigens as 
adjuvants for CD4* T cell stimulation and for induction of 
co-stimulation for the peptide and disease-specific CD8* cytotoxic T 
cells (CTL) or with adjuvants, co-factors or general CD4* T-cell 
stimulation antigens for co-stimulation of CD8* CTLs. 
In principle the epitopes identified within step a and b can be used in 
several vaccination strategies and are as such not restricted to the 
one mentioned above. 

Vaccination, in particular, intracutaneous or parenteral vaccination in 
humans with the vaccine pool. 

There are two patents claiming the application of ANN for the prediction of 
MHC binding motifs of biologically active peptides and peptide mimetics 
(DE 198 26 442, WO 98/53407 C2). 

The method presented within this invention preferably combines the 
application of ANN with two additional steps: 

An optimization step where amino acids (AA) within the so-called 
anchor residues of the MHC-I binding peptides are being exchanged 
A procedure selecting conserved regions within a protein sequence. 

The optimization and the selection. procedure can apply knowledge and/or 
computer-based algorithms. 

This invention provides the following advantages in comparison to 
previously described methods for T-cell epitope prediction: 
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The epitopes yielding highest CTL response in most human 
individuals will be the least variable ones and therefore be of the 
highest pharmacological relevance. 

The specific optimization step will improve" the MHC-binding 
properties of the peptides without affecting the biological activity of 
the peptide. The application of this optimization procedure to all 
9-mers (8-mers, 10-mers) of a given tumor antigen allow the 
identification of previously not identified epitopes and mimitopes. 
Further, it is possible to bbtain biologically active peptides that differ 
from naturally occurring sequences. 

The parallel prediction of binding to several different MHC-I 
molecules allows the identification of epitopes, that have a 
significant higher application potential. 

The application of knowledge based filters (PEST sequences; non 
tolerated amino acids etc.) increase the probability of biological 
effects and application potential. 

The usage of in vitro assays for the verification of the epitopes that, 
based on the biological reactivity of cytotoxic T cells of cancer or 
virus infection patients, ensures detection of disease-relevant 
specificities 

The usage of state of the art pattern recognition technologies in 
combination with the afore mentioned steps yield in a higher 
prediction accuracy. 

For vaccination, 1-100 peptides related to a particular virus or 
cancer, will be used as a vaccine. Additionally, specific co-factors, 
adjuvants and CD4 4- T-cell antigen for co-stimulation of CD8+ T- 
cells will be included. This can be applied intracutaneously, 
parenterally, etc. 

Fig. 1 schematically shows a similarity search, and 

Fig. 2 shows an example of calculation of conservation scores. 
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Examples 
Example 1 

The performance of the method of the invention will be explained in the 
following by way of an example. 

First, an antigenic protein is selected, e.g. from a database. In the case of 
this example, a protein having 509 amino acids is chosen as an antigenic 
protein. Said protein is is fragmented virtually (by computer) to give 500 
peptides having a length of 9 amino acids each. A conservation score is 
determined for each of these 9-mers. In the subsequent optional step 
anchor positions and preferred amino acids at these positions are 
determined. In the case of this example it is assumed that anchor positions 
are at positions 2 and 9 and 2 optimal amino acids each are described in 
the prior art for each position. This leads to 8 variants for each 9-mer, so 
a total of 4,500 epitopes are present (8 variants and 1 original). These 
epitopes are now tested as to whether they are CD8 4- T-cell epitopes by 
means of a pattern recognition technology, e.g. SAR and ANN, 
respectively. In particular, MHC binding capacity can be determined this 
way. 

Assuming it is found that 300 epitopes are effective, the conservation 
score of these 300 epitopes is now used to determine the best 100 
epitopes. 

Subsequently, a filter can be used which sorts out particular peptides, e.g. 
peptides containing proline (because of unfavorable folding) and peptides, 
in the case of which synthesis problems are to be expected. 

In this way the number of epitopes can be further reduced, e.g. to 50. 
These 50 epitopes can now be verified in an in vitro assay for their 
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activity. Part or all of the peptides verified as being active can then be 
pooled and used as a vaccine. 

Example 2 

In vitro verification of the T-cell activation functionality of peptides 
identified or optimized, respectively, according to the invention. 



Peptide sequence 


Source protein 


Frequencies reactive CD8+ T cells 


Melanoma 


Cutaneous T-cell 
lymphoma 


VTAQWLQA 


GP100 


0,08 


0,04 


VLAQWLQL 


GP100 optimized 


0,18 


0,12 


LVHFLLLKY 


MAGE 


0,99 


0,03 


LLHFLLLKL 


MAGE optimized 


1,10 


0,03 


FVWLHYYSV 


TYR2 


1,01 


0,01 


FLWLHYYSL 


TYR2 optimized 


0,82 


0,02 


Control 




0,10 . 


0,02 
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Claims 

1 . A method for providing, identifying or/and optimizing peptides which 
induce cytotoxic T-lymphocytes, comprising the steps: 

(a) selecting one or more antigenic proteins, 
<b) selecting conserved regions within the protein sequence of 
the one or more antigenic proteins, and 

(c) identifying CD8 + T-cell epitopes within the protein sequence 
of the one or more antigenic proteins. 

2. The method according to claim 1, further comprising the step: 

(d) optimizing the identified CD8+ T-cell epitopes by exchanging 
one or more amino acids at the anchor positions thereof. 

3. The method according to claim 2, wherein step (d) is performed 
prior to step (c). 

4. The method according to any of the preceding claims, wherein step 
(c) is performed using an artificial neural network. 

5. The method according to any of the preceding claims, wherein in 
step (a) one or more antigenic proteins for cancer or/and a virus are 
selected. 

6. The method according to any of claims 1 to 5, wherein peptides 
having 4 to 30 amino acids are obtained. 



7. 



The method according to any of the preceding claims, wherein an 
additional filtering step is applied. 
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8. The method according to any of claims 1 to 7, further comprising 
the step: 

verification of the activity of the identified or/and optimized 
peptides in vitro. 



9. Pharmaceutical composition comprising one or more peptides which 
induce cytotoxic T-lymphocytes obtainable according to the method 
of any of claims 1 to 8. 

10. The pharmaceutical composition according to claim 9, further 
comprising adjuvants, co-factors and/or co-stimulating agents. 

11. A method for providing a pharmaceutical composition for the 
induction of cytotoxic T-lymphocytes, comprising: 

(a) providing one or more peptides which induce cytotoxic T- 
lymphocytes according to the method of any of claims 1 to 8, 
and 

(b) using the one or more peptides for the manufacture of a 
pharmaceutical composition. 

1 2. Isolated peptide as depicted in any of Tables 1 , 2, 3 or 4, including 
the variants generated by AA exchange at positions 2, 6 and/or 9. 

13. Isolated peptide having the formula VTAQVVLQA, VLAQVVLQL, 
LVHFLLLKY, LLHFLLLKL, FVWLHYYSV or FLWLHYYSL, including 
the variants generated by AA exchange at positions 2, 6 and/or 9. 

14. Pharmaceutical composition comprising one or more peptides 
according to claim 12 or 13. 
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15. Use of a peptide according to claims 12 or 13 or obtainable 
according to the method of any of claims 1 to 8 for the manufacture 
of a pharmaceutical for the induction of cytotoxic T-Iymphocytes. 

1 6. Use according to claim 1 5 for the prevention, treatment or diagnosis 
of cancer or viral infections. 
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Figure 1 : 

Similarity search with selected tumor associated antigen using BLASTP against 
SWISS-PROT 



BLASTP 2.1.3 

Reference: „ . M _ _ 

Altschul, SteDhen F. , Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang", Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs", Nucleic Acids Res. 25:338 9-3402. 



Query= 



(385 letters) 



Database: Non-redundant SwissProt sequences 

96,4 69 sequences; 35,174,128 total letters 

Sequences producing significant alignments: 

CD34_HUMAN HEMATOPOIETIC PROGENITOR CE. . . 543 e-154 
CD34_CANFA HEMATOPOIETIC PROGENITOR CE... 359 8e-99 
CD34 MOUSE HEMATOPOIETIC PROGENITOR CE. . . 349 9e-96 



Score E 
(bits) Value 



1 ' 17 WTALCLLSLLPSGFMSLDNNGTATPELPTQGTFSNVSTNVSYQETTTPSTLGSTSLHPVS 76 

3183511 17 TTTPSTLGSTSL 76 

04QR215 17 ' F. .TNTETVI. P.TVPTSTEIM.A. , E . T . KR. AITLTPSGTTTLYS . . 76 

x • ; ; ; ' - ; ;i H.N.LTS. .T.TS. . .ISPS. P. .S.VE.NITSSIPGSTSHYLIY 71 



3182946 17 



136 



1 77 qhGNEATTNITETTVKFTSTSVITSVYGNTNSSVQSQTSVISTVFTTPANVSTPETTLKP 

77 NTNSSVQSQTSVISTVFTT 136 

[ !hv! ! ! .E- .LTP.TMNSSVQSQTSLAITVSFT.T.F. .SSV. .E. 13 6 
V. .G.P.GS.TPHTFSQPQTSPTGILPTTSDSI. .S.M.W.S 131 



3183511 
2498215 77 
3182946 72 



. DSSGT . AT . S . 
.DSSKT.PA.S. .M.N. 



1 137 SLSPGN v — S DLS TT ST SLAT S PTKFYT S S S P ILS D IKAEIKCS GIREVKLTQG 188 

31835^1 137 . — SDLSTTSTSLATSPTKPYTSSSP 168 

2498^15 137 . . L. . . GSDPPYN— STSLVTSPTEYYTSLSPTPSRNDTP .T . . G VK N. 

3182946 132 ..PSI. . SDYSPNNSSFEMTSPTEPYAYTSSSAF . A. . G R-A. 



194 
185 



189 ICLEQNKTSS 



CAE FKKDRGEGLARVLCGEEQADAD AGAQVCS LLLAQS E VRPQCLLLVLA 



3183511 189 
2498215 195 
3182946 186 



. L.E . . 
. LSEA. 



.ED. 
.E. . 



. . .NE.K.TQ. . 
. EK. .D.XQX. 



. — .KEP.E. 
.EK.E.E, . . 



.M. 



248 
248 
252 
245 



1 249 NRTEISSKLQLMKKHQSDLKKLGILDFTEQDVASHQSYSQKTLIALVTSGALLAVLGITG 308 

3183511 249 1 * " ' 3 ° b 

2498215 253 .K..LF LR R G R I. . . . - . - 

3182946 246 . S . . LP E R QS.NK..IG R V...A..1. 



312 
305 



365 



3 309 YFLMNKRSWSPTGERLGEDPYYTENGGGQGYSSGPGTSPEAQGKASVNRGAQENGTGQAT 368 

3183511 309 GGGQGYSSGPGTS 368 

2498215 313 GGGQGYSSGPGVS P 372 

3182946 306 GGGQGYSSGPGAS . .T N.T 

1 369 SRNGHSARQHWADTEL 385 

3183511 369 385 

2498215 373 . , M 389 

3182946 366 382 



Database: Non-redundant SwissProt sequences 

Posted date: May 11, 2001 5:54 AM 
Number of letters in database: 35,174,128 
Number of sequences in database: 96,4 69 



Lambda K H 
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0.312 0.128 0.357 
Gapped 

Lambda K H 

0.267 0.0410 0.140 



Matrix: BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB: 19900574 

Number of Sequences: 9 64 69 

Number of extensions: 647 916 

Number of successful extensions: 1005 

Number of sequences better than 10.0: 4 

Number of HSP's better than 10.0 without gapping: 3 

Number of HSP's successfully gapped in prelim test: 1 

Number of HSP's that attempted gapping in prelim test: 998 

Number of HSP's gapped (non-prelim) : 4 

length of query: 385 

length of database: 35, 17 4 , 128 

effective HSP length: 56 

effective length of query: 329 

effective length of database: 29,771,864 

effective search space: 9794943256 

effective search space used: 9794943256 

T: 11 

A: 40 

XI: 16 { 7.2 bits) 
X2: 38 (14.6 bits) 
X3: 64 (24.7 bits) 
SI: 42 (21.9 bits) 
S2: 66 (30.0 bits) 
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Figure 2: 

Calculation of 2 different conservation scores for all possible epitopes within position 25 - 78 of the 
query molecule CD34_HUMAN, when using BLASTP as shown in figure 1., 

CD34_HUMAN HEMATOPOIETIC PROGENITOR CE.... 543 e-154 
CD34_CANFA HEMATOPOIETIC PROGENITOR CE. . . 359 8e-99 
CD34 MOUSE HEMATOPOIETIC PROGENITOR CE. . . 349 9e-96 
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